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ABSTRACT 

The presence of massive, compact, quiescent galaxies at z > 2 presents a major challenge for theoretical 
models of galaxy formation and evolution. Using one of the deepest large public near-IR surveys to date, we 
investigate in detail the correlations between star formation and galaxy structural parameters (size, stellar mass, 
and surface density) from z = 2 to the present. At all redshifts, massive quiescent galaxies (i.e. those with little or 
no star formation) occupy the extreme high end of the surface density distribution and follow a tight mass-size 
correlation, while star-forming galaxies show a broad range of both densities and sizes. Conversely, galaxies 
with the highest surface densities comprise a nearly-homogeneous population with little or no ongoing star 
formation, while less dense galaxies exhibit high star-formation rates and varying levels of dust obscuration. 
Both the sizes and surface densities of quiescent galaxies evolve strongly from z = 2 - 0; we parameterize this 
evolution for both populations with simple power law functions and present best-fit parameters for comparison 
to future theoretical models. Higher-mass quiescent galaxies undergo faster structural evolution, consistent 
with previous results. Interestingly, star-forming galaxies' sizes and densities evolve at rates similar to those 
of quiescent galaxies. It is therefore possible that the same physical processes drive the structural evolution of 
both populations, suggesting that "dry mergers" may not be the sole culprit in this size evolution. 
Subject headings: cosmology: observations - galaxies: evolution- galaxies: high redshift- galaxies: structure 



1. INTRODUCTION 

Ample evidence now exists for the presence of massive 
galaxies at z > 2 with little or no ongoing star formation, 
suggesting that a non-negligible fraction of the local early- 
type galaxy population was effectively in place only a few 
Gyr after the big bang, and that massive red galaxies con- 
stitute a significant fraction of (or po ssibly even dorninate^ 
the stellar mass density at z ^ 2 (e.g. McCarthv et al. '2004"; 
Labbe et al. 2005; Daddi et al. 2005; Ki-iek et al. 2006, 2008a; 
Rudnick et al. 2006; Marchesi ni et al.ll2007l:iStutz et al.ll2008i: 
Toft et al. 2009). Although their masses and star forma- 
tion rates (SFRs) are nominally similar, it has since be- 
come clear that these quiescent galaxies at z > 1 have dra- 
matically different structures compared to local ellipticals: 
specifically, the high-redshift galaxies' sizes are much smaller 
(e.g. Daddi et al. 2005; Trujillo et al. 2006a; Zirm et al. 2007; 
loft et al. 2007; van Dok kum et al.ll2008l: ICimatti et al.ll2008 : 
Damianov et al. 2009). Their surface densities likewise 
evolve strongly with redshift, as does the "threshold" sur- 
face density abo ve which galaxies are predominantly quies- 
cent (iFranx et a l. 2008; Maier et al. 2009). These phenomena 
pose several new challenges for studies of galaxy formation. 
Indeed, if massive quiescent galaxies at z ^ 2 were almost 
universally compact and dense, why are similar objects in the 
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local univers e practically nonexistent (e.g. lTruiillo et al.l2009t : 
iTavlor et al. ''2009b)? 

Several mechanisms for this transformation have been pro- 
posed, including a scenario whereby local early-type galax- 
ies are built through a series of "dry" (i.e., gas-poor) merg- 
ers of these compact progenitors ( Khochfar & Silk 200q; 
iHopkins et alll2009bl; iFeldmann et al.l l2009). Such mergers, 
being largely non-dissipative, would over time tend to "puff 
up" compact galaxies. Some observational estimates of the 
major dry merger rate since z ~ 1 indicate that th ese are in- 
deed a factor in the buildup of massive galax ies jBell et all 
l2006t iBundv et alj|2009l; Ide Ravel et ai]|2009h . but the rem- 
nants of such major mergers would also have correspondingly 
larger masses and so it is unclear whether major dry merg- 
ers can entirely solve the size-mass discrep ancy (though they 
are hk ely to mitigate it to some degree; e.g. lvan der Wei et alj 
l2009h . Minor mergers may be a more important process - 
both from a simple virial argument (Naab et al. 2009) and 
because massive compact "cores" at high redshift may ac- 
crete material in their out skirts to form the massive ellipti- 
cals seen today (Bezanso n^t alj|2009l ; IHopkins et al.ll2009al ; 
Ivan Dokkum et al.,,201Q) . However, theoretical models have 
yet to converge on a definitive explanation - some proposed 
models predict galaxy sizes dramatically different from those 
observed (Jpung et al. 2009). while others match observed 
sizes at z = 2 and z = dKhochfar & Silkl2006i) but not at z 1 
dvan der Wei et al.ll200 8lK 

Whatever the underlying mechanism may be, observations 
of large samples of galaxies over a broad redshift range are 
crucial to adequately test current and upcoming models. How- 
ever, many of the aforementioned galaxy structure studies em- 
ploy HST-NICMOS, which provides accurate size measure- 
ments but can typically image only one target at a time and 
is thus inadequate for observing very large samples. Optical 
imagers are much larger, but at z ^ 2 near-IR data are more ro- 
bust for size determinations; these bands fall in the rest-frame 



2 



Williams et al. 



optical and therefore better trace the distribution of stellar 
mass than the rest-frame UV (observed optical). Furthermore, 
spectroscopy of quiescent galaxies (both to determine their 
redshifts and confirm their quiescence) requires large amounts 
of observing time due to the absence of strong emission lines; 
again, while this can be perf ormed f or a few bright galaxies at 
z ~ 2 with current facilities jKriek e t al. 2006), it is infeasible 
for the large samples of fainter objects needed to provide a 
comprehensive view of the high-redshift galaxy population. 

Fortunately, although spectroscopy and space-based imag- 
ing give the most precise picture of star formation and galactic 
structure for individual objects, the average properties of large 
samples can be accurat ely investigated with less "expensive" 
data. For example. both lTruiillo et al.l (l2006bl) and lPranx et al.l 
(1^08) successfully based their galaxy size measurements on 
ground-based imaging and photometric redshifts. Although 
there are inherently greater uncertainties on the size and mass 
of any individual object than with NICMOS imaging and 
spectroscopic redshifts, the average structural parameters of 
large populations can be accurately determined. Furthermore, 
even without spectroscopy there are several ways to determine 
which galaxies are quiescent and which are actively form- 
ing stars. One straightforward method is to identify galax- 
ies on the "red sequence" (using, e.g., the rest-frame U-V 
color), which at low redshift is primarily composed of qui- 
escent galaxies with strong 4000A breaks. However, star- 
forming galaxies containing large amounts of dust can have 
similarly red colors, so "red and dead" galaxy samples se- 
lected through this color cut are likely to be contaminated 
with substantial numbe rs of "red and dusty" starbursts with 
mcreasmg redshift (e.g. lWilliams et al.ll2 009). 

With photometric observations in a suitable set of filters, 
the shape of the broadband spectral energy distribution (SED) 
can be used to distinguish between the sharp 4000A break 
characteristic of old stellar populations and the more gradual 
reddening caused by dust absorption; the best-fit SFRs and 
dust column densities from SED modeling codes can then be 
used to define quiescent galaxy samples. Empirical selection 
techniques using m ultiple re st-frame colors, such as the one 
employed bv Williams et al.l (12009,) . are also effective at sepa- 
rating dusty from "dead" galaxies up to z = 2 (and even higher; 
I. Labbe et al., in preparation) with little dependence on input 
templates or models. Coupled with the large, deep multiwave- 
length surveys and improved photometric redshift techniques 
that have emerged in the past few years, these diagnostics pro- 
vide an unprecedented view of star formation and its relation 
to other physical properties in massive galaxy populations. 

Using the largest such publicly-available data set with suffi- 
cient depth, the UKIDSS Ultra-Deep Survey (and overlapping 
imaging from the Subaru-XMM Deep Survey and Spitzer 
Wide-Area Extragalactic Survey), we analyze the structural 
evolution of quiescent galaxies up to z = 2, specifically the 
correlations between star formation activity, galaxy size, and 
stellar mass surface density. First in ^we review the data 
and describe the size and mass measurements employed for 
this study. Next in ^we investigate how the specific star- 
formation rate (sSFR) is related to galaxy mass, size, and sur- 
face density, and in ^present quantitative constraints on the 
evolution of these structural properties for both star-forming 
and quiescent galaxies. Finally, the uniformity of the con- 
nections between star formation and structure is discussed 
in ^ along with some novel ways to interpret these correla- 
tions. AB magnitudes and a concordance cosmology {h = 0.7, 



Qm = 0.3, f^A = 0.7) are assumed throughout. 

2. DATA AND MEASUREMENTS 

The high-redshift galaxy sample analyzed herein is based 
on an updated versio n of the A"-sel ected galaxy catalog de- 
scribed in detail by Williams et al.l (i2009) ; a brief summary 
follows. This catalog comprises near-infrared J and K data 
taken from the UKIDS S Ultra-Deep Survey Data Release 1 
(UDS; Lawrence et"aLl |2007; Warren et al. 2007a), with over- 
lapping BRi'z' imaging from the Subaru-XMM Deep Sur- 
vey (SXDS: ,Sekiguchi et al. 2004). and 3.6/4.5/im data from 
the Spitzer Wide-area Inf rared Extragalactic (SWIRE) sur- 
vey ("Lons dale et al.ll2003l) . Using the positions and shapes 
of bright but unsaturated stars in the different bands, all mo- 
saics were astrometrically matched and convolved to consis- 
tent p oint-spread functions ( PSFs). Using the SExtractor soft- 
ware (|Bertin & Arnoutslll996h . "color" fluxes were measured 
in fix ed l-'75 ape rtures and total K fluxes from flexible ellip- 
tical (lKronlll 980) apertures.^ 

In addition to the original 'Wi lliams et al.l(l2009h catalog, we 
incorporate the //-band mosaic from the UKIDSS Data Re- 
lease 3 (to be described by S. J. Warren et al., in preparation) 
with PSF- and astrometric matching and flux measurements 
performed in the same manner as with the other bands. Photo- 
metric redshifts were calculated with t he H data includ ed us- 
ing the publicly-available code EAZY (iBrammer et al.l l2008). 
The new photometric redshifts differed very little from the 
original ones presented by (Williams et al. 2009), but inclu- 
sion of the H band should in principle improve both the Zphot 
values and stellar mass estimates at z > 2. At the adopted flux 
limit of K < 22 A, this catalog contains nearly 3x10"* galax- 
ies. 

Because of its depth and ~ 0.8 deg^ area, the UDS is well- 
suited for studies of large galaxy samples at z > 0.5; at lower 
redshifts, however, the comoving volume probed by the UDS 
is too small to provide meaningful samples (and may also be 
severely affected by cosmic variance). Thus, for comparison 
to the UDS we also include the same z ^ 0.06 SDSS sam- 
ple used by iFranx et al.l (12008[). To summariz e, this sam- 
ple was originally defined bv [kauffmann et akl (12003 ). and 
IFranx et aTl (120081) performed minor corrections in the mass- 
to-light ratios (accounting for the fact that the galaxy centers, 
where the spectroscopic fibers are placed, are typically red- 
der than the outskirts) as well as total flux corrections. The 
redshift range of z = 0.05-0.07 was chosen to avoid selec- 
tion effects on galaxies with high mass (at the low-z end) and 
small angular sizes (at higher redshift). 

2.1. Sizes 
2.1.1. Fitting 

Size measureme nts we re performed in a manner similar to 
fliat by Toft et aT (20071). We use the Galfit software pack- 
age (.Peng et aL.2002i) to fit lSersid ( [1968) models to all bright 
{K < 22.4) sources detected in the unconvolved (i.e., before 
PSF matching, to ensure the highest possible angular resolu- 
tion) UDS K image. This image had a typical seeing FWHM 
of 0"7. First a square postage stamp 21 pixels (4 "2) on a side 
was made around each galaxy to be fitted. Initial guesses for 
the effective radius r^, magnitude, ellipticity, and position an- 
gle were taken from the SExtractor catalog. The PSF used 

^ Available from 

|http : //www ■ strw . leidenuniv . nl/galaxyevolution/UDs] 
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Fig. 1 . — Top panel: Output effective radii of simulated galaxies as a func- 
tion of ,„. Larger point sizes represent brighter galaxies; for clarity only a 
random 10% of our simulated galaxies in these magnitude bins are plotted. 
Bottom panel: Fractional deviation {re.om — >'e.in)/i'e.i„ as a function of input 
radius. Fainter galaxies exhibit larger random uncertainties. Although the 
faintest, largest galaxies show a small systematic offset (10 — 20% at worst), 
these deviations are small compared to the random scatter and the measured 
trends with redshift discussed in 

by Galfit to deconvolve the galaxy images is the average PSF 
over the image (taken as the median of 300 bright, unsatu- 
rated stars; also used in the PSF-matching step described in 
Williams et al. 2009). The Sersic parameter n was allowed to 
vary between 1 and 4 and the effective radius between 0."01 
(effectively a point source) and 15". Since galaxies in gen- 
eral may exhibit color gradients, it is important to measure 
sizes at approximately the same rest-frame wavelength when 
comparing samples at different redshifts. The procedure de- 
scribed above was thus repeated for the unconvolved J and H 
images (both exhibiting 0"8 seeing), and we define the size of 
a galaxy as its circularized effective radius (r^. = ^/ab) at a rest- 
frame wavelength of 8000A, interpolated from the size mea- 
surements of the two adjacent bands. In other words, when 
rest-frame 8000A falls between two observed bands / and j, 
the interpolated size is 

r..8000 = re.i + '^fdZlll [8000( 1 + z) - AJ ( 1 ) 

Aj-Ai 

In practice the interpolated sizes are quite similar to those 
computed by, e.g., simply taking the size measured from the 
observed band closest to 8000A. However, the interpolation 
method smoothes out possible discretization effects due to the 
choice of the measurement filter, and also in principle helps 
to mitigate failed size measurements in a given band. 

2.1.2. Simulations 

Previous studies have shown that reliable galaxy sizes can 
be measured from ground-based data, provided the signal-to- 
noise ratio is sufficiently high and the PSF across the im- 
age is stable (e.g. Truiillo et al. 2006b; Franx et al. 2008). 
For instance, Truiillo et al. (2006b) use simulations to inves- 
tigate systematic effects, finding them to be minimal down 
to r, ^ 0.2-0.3X FWHM(PSF), though individual measure- 
ments exhibit a large degree of scatter. The effective radius is 
typically the most robust structural parameter measured; other 



variables defining the profile shape (e.g. Sersic index and axis 
ratio) are far more susceptible to systematic uncertainties. 

It is nonetheless instructive to investigate whether the spe- 
cific characteristics of the UDS data and our fitting techniques 
have introduced systematic biases; we thus performed simu- 
lations analogous to Truiillo et al. ( 2006b) to test our effective 
radius measurements. Model galaxy profiles over a range of 
magnitudes = 1 8 - 23, r^. from 0. 1 to 1 arcsec, and n = 1 - 4 
were created; these were chosen to span typical galaxy angu- 
lar sizes at z = 1-2. Blank postage stamps (containing only 
noise) were then cut from random positions in the UDS K- 
band image, and the model galaxies added to them. The iden- 
tical fitting procedure used on the real images was then ap- 
plied to 15000 such simulated postage stamps to derive the 
output effective radii. 

Figure[T]shows the distribution of input and output effective 
radii, and the fractional difference between the two, as a func- 
tion of input effective radius. Brighter galaxies are plotted 
as larger points. As expected, relatively faint galaxies exhibit 
larger random uncertainties in their size measurements. While 
systematic deviations are seen for both the faintest and largest 
galaxies, these are relatively small (with a median of ^ 10%) 
compared to the random errors; additionally, since we only in- 
clude galaxies with z > 0.5 in our analysis, objects with such 
large angular sizes are quite rare. The se observed o ffsets are 
comparable to the offsets reported by iTruiillo et alj (l2006b); 
most importantly, the average effective radii of galaxies with 
sizes 0"1 - 1 "0 appear to be reasonably reliably recovered. 
Given the large scatter, however, it should be noted that in- 
dividual measurements of galaxy sizes are highly uncertain 
even if ensemble averages are accurate. 

2.1.3. Empirical consistency tests 

The simulations described above provide a realistic view of 
some of the random and systematic uncertainties in this analy- 
sis. However, to some extent these are idealized; for example, 
the evident ability to measure effective radii as small as one- 
half of a pixel may be true in a numerical sense, but in real 
images additional effects such as PSF variations will limit the 
precision that can be achieved. We therefore supplement the 
simulations with a set of purely empirical tests to better assess 
the reliability of the galaxy size measurements. Specifically, 
we check the following three important assumptions: (1) the 
simple interpolation of galaxy sizes between different bands 
is robust; (2) a single PSF is sufficient to model the entire 
image; and (3) the fitting box size of 4"2 is large enough to 
obtain reasonable fits. 

First, to test the interpolation assumption, we estimated the 
//-band sizes by averaging the measured J and K sizes; the es- 
timated and measured H sizes matched very well with no no- 
ticeable systematic offset. The "single PSF" assumption (2) 
was tested by generating four different PSFs, each composed 
of bright stars from the four quadrants of the K mosaic, and re- 
measuring the sizes of galaxies within the central 40" x 40" of 
the K mosaic. While there were small systematic differences 
in the sizes measured with the different PSFs, these were on 
the order of 0."02, which is effectively negligible compared to 
typical measured galaxy sizes of 0"2-l"0. Similarly, the me- 
dian galaxy sizes show no evidence of systematic deviations 
with position in the image, indicating that the PSF does not 
significantly vary. Finally, the box-size assumption (3) was 
tested by re-fitting a subset of galaxies using postage stamp 
sizes of 5."2 and 6."2; the new sizes were fully consistent with 
those measured in the original 4. "2 cutouts. The exception to 
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this is the largest objects in this sample with > I'.'l (about 
lOkpc radius at z = 1-2); these objects exhibit 30% larger 
sizes when a larger fitting box is used, but such objects com- 
prise only a tiny fraction (< 4%) of our z > 0.5 sample and so 
do not affect our results. 

Re-fitting the same subset of galaxies using different PSFs 
and box sizes also gives an estimate of the measurement un- 
certainties (i.e., the degree to which a given size measure- 
ment is reproducible given different assumed input parame- 
ters). Between 0."2 and 1"0 the measured sizes were consis- 
tent with each other within ^5%; at 0"! the fractional error 
rapidly increases to about 10%-15%, and at smaller effective 
radii sizes can no longer be reliably inferred. To determine 
the efficiency to which Galfit can distinguish between point 
sources and extended objects in these data, we also measured 
the radii of stars (found via color selection) using the same fit- 
ting parameters. Interestingly, while essentially all of the stars 
have best-fit radii of zero (as is expected for point sources), 
Galfit measures non-zero radii for the vast majority of com- 
pact galaxies - even those with best-fit radii much smaller 
than the PSF. 

Thus, these simulations and empirical tests confirm that ef- 
fective radius measurements are likely free of major system- 
atic effects down to = 0"l-0."2. Even if s ome offsets 
are present (as seen in our simulations and by Truiillo et al.l 
|2006b), they are comparable to or smaller than the system- 
atic uncertainties on other parameters (e.g., in mass, redshift, 
and SFR determinations). We therefore conclude that mea- 
surements of the average sizes of large galaxy samples down 
to sr nall radii are robust given these data. This is similar to 
what iFranx et aTl (120081) found by fitting their ground-based 
data and comparing it to Advanced Camera for Surveys (ACS) 
imaging. 

2.2. Stellar masses and star-formation rates 

The stellar masses of galaxies in this sample were cal- 
culated with the Fitting and Assessment of Synthetic Tem- 
plates (FAST) code (Krieketal. 2009). Thi s code uses 
minimization to fit .Bruzual & Chariot 620031) stellar popula- 
tion evolut ion models to the observed broadband photome- 
try. A Sal peted (1195 5*) initial mass function (IMF) and so- 
lar metallicity were assumed; masses were then shifted by a 
factor of -0.2 dex for consistency with the z ^ Sloan Dig- 
it al Sky Survey (SPSS) masses (which were com puted with 
alKroupal (12001b IMF). We re-fitted a subset with iMarastonI 
(l2005ir models and both IMFs to verify that this factor is 
essentially constant and does not vary systematically with 
galaxy color or mass. Redshifts were fixed to the Zphot val- 
ues derived by EAZY, and a variety of evolutionary histories 
were allowed in the fitting, including exponentially-declining 
(SFR- e"'/^) models with r ranging from 10^ '''Gyr. The 
computed masses are consistent with those calculated using 
other standard methods, for example using HYPERZ as a 
fitting engine and scalin g the model amplitudes to es timate 
mas ses (as done by, e.g., lForster Schreiber et al.ll2004l) . Note 
that lMarastonI (120051) mod els result in galaxy masses approxi- 
mately 0.2 dex lower than iBruzual & Chariot! (120031) models; 
however, unlike the IMF correction factor, this discrepancy 
varies somewhat with galaxy mass. 

Other parameters, such as the degree of dust extinction 
Ay, average stellar population ages, exponential factor r, 
and SFR are also computed by FAST during the SED fitting 
procedure. Most of these are highly uncertain when based 
on broadband data (.Kriek et al.. .2008b.) ; however, the sSFR 
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Fig. 2. — Size-mass relation for galaxies at z ~ 0.06 (SDSS; upper left) and 
three redshift ranges in the UKIDSS UDS. The grayscale denotes the number 
of points within each bin; dashed lines indicate the angular radii {d ~ 0"2) 
below which individual size measurements have larger uncertainties. Mas- 
sive, compact galaxies (i.e. those in the lower right-hand portion of each 
panel) are almost nonexistent at z = but become progressively more com- 
mon at higher redshifts. The radii of galaxies with M > lO'" Mq evolve 
roughly as re ~ (1 +z)"'"*' (see also Table|2j- 

(sSFR=SFR/M,t) is somewhat more robust. This quantity, 
taking the mass of the host galaxy into account, provides a 
more meaningful characterization of star-formation activity 
(e.g., an SFR of lOMo/year is far more significant in a dwarf 
galaxy than a giant elliptical). Furthermore, since the total 
SFR and mass both exhibit a similar dependence on the as- 
sumed IMF, the sSFR is less sensitive to the choice of IMF 
and input stellar population model(s) than the absolute SFR. 

2.3. Rest-frame fluxes 

Rest-frame colors, particularly the combination of U -V 
and V -J, are invaluable for distinguishi ng quiescent galax- 
ies from those actively forming stars dWuvts et al.l 120071 ; 
Williams et al. 2009). Although such colors can be derived 
directly from the templates used for photometric redshift fit- 
ting, by definition these colors are confined to the range of 
colors spanned by the template set (which may be narrower 
than the intrinsic range of the galaxy population). Instead, 
we interpolate rest-frame U, V, and J fluxes directly from 
the observed photometry and photometric redshifts using the 
same method as Williams et al. (2009) but with the new H- 
band d ata included. This is accomplished using the InterRest 
utility (iTavlor et al.ll2009a) . whi ch in turn is an implem enta- 
tion of the method described by iRudnick et alj (l2003h . The 
reddest observed band in our data set is IRAC 4.5/im, corre- 
sponding to rest-frame 7 at z — 2.5; reasonably robust fluxes 
can thus be interpolated up to approximately this redshift. 

2.4. Completeness 

In general, the mass completeness limit of a flux-limited 
galaxy sample depends strongly on color: since red galax- 
ies have higher mass-to-light ratios, mass-limited red galaxy 
samples become incomplete at brighter magnitudes than for 
blue galaxies. Completeness limits derived for strongly het- 
erogeneous (mixed red and blue) samples are therefore dom- 
inated by blue galaxies, and such samples can still exhibit 
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Fig. 3. — Top: Size-mass relation in SDSS (left panel), and in three UDS redshift bins: z = 0.5 -I, z= 1 — 1.5, and z = 1.5 -2 in the second, third, and fourth 
panels respectively. Bottom: Surface density-mass relation in the same redshift bins. In all plots, the color denotes the mean sSFR of the underlying galaxies; 
"faded" bins contain two or fewer galaxies. Dashed lines indicate apparent effective radii of 0"2, below which (or above the corresponding surface density) 
individual galaxy size measurements have relatively large uncertainties (though the binned averages are more reliable). Strong evolution is evident in the mass- 
size and mass-suiface density relations of low-sSFR galaxies, and the compact, dense galaxies which are clearly present in the early universe gradually vanish 
with decreasing redshift. 



serious incompleteness effects among massive red galaxies. 
Computing mass limits from red galaxies alone therefore pro- 
vides a more conservative estimate of the overall sample com- 
pleteness. 

We derive mass completeness limits for the UDS sample 
by comparing the masses of red galaxies (CZ-Vrest > 1-5) to 
their observed K magnitudes. The 75% completeness limit 
is then defined as the mass at which about 25% of the galax- 
ies in the sample fall below the adopted K flux limit. This is 
performed for the 1 .5 < Zphot < 2.0 redshift bin, thus ensuring 
that our analysis does not suffer from significant incomplete- 
ness effects up to z = 2. To ensure accurate size measurements, 
the size and surface density analysis is restricted to galaxies 
with Kiii^ < 22.4 (approximately 1 magnitu de brighter tha n 
the formal 5(7 survey limit, also used in Will iams et alJl20()9l) . 
corresponding to a mass limit of logM^ > 10.6. 

3. THE RELATION BETWEEN MASS, SIZE AND STAR FORMATION 
3.1. Mass-size relation 

Our newly constructed sample allows us to study the mass- 
size relation and its evolution to z = 2. This relation, in three 
redshift bins from the UDS, is plotted in Figure|2] To supple- 
ment the higher-redshift data, the same relation from SDSS at 
z ^ is also shown. 

Two notable trends stand out: first, there is a correlation 
between radius and mass, whereby the most massive galaxies 
out to z = 1.5 have on average larger sizes than less massive 
galaxies; this effect is weaker or nonexistent at z = 1.5-2. A 
second and related point is that there are effectively no mas- 
sive, compact galaxies (M > 10" Mq, r^, < 2kpc; in the lower 



right-hand region of the size-mass plot) at z = 0, but at higher 
redshifts this area of parameter space becomes progressively 
more populated. This simply reflects the strong size evolu- 
tion of massive galaxies found in several previous studies and 
introduced in § 1 . 

3.2. Star formation as a function of mass and size 

iKauffmann etall dlOOl l2006h and iFranx et al.l dlOOSh 
showed that the broad range in galaxy effective radii, at a 
given mass, is tightly coiTelated with sSFR out to z = 2.5. 
These authors concluded that sSFR is a tight function of 
stellar mass surface density (M^//?^), or velocity dispersion 
{^M^jre, Franx et al. 2008). Put differently, these results im- 
ply that the size-mass relation is different for galaxies with 
different sSFRs. 

Our sample is ideally suited to study this aspect at higher 
redshifts, as it covers an area ^18 times larger than that used 
by Franx et al. (2008). We show in Figure|3]how the sSFR de- 
pends on mass and size. It is clear that the dependence is very 
strong: at a given mass, galaxies with low sSFRs (binned and 
plotted as red/yellow squares in this figure) have small sizes, 
while those with large sizes have high sSFRs (green/blue 
squares). The effect is not only strong at very low redshift, 
but extends to the highest redshift bins. At the same time, the 
sizes of galaxies with low sSFRs (red in the figure) are very 
small at high redshift, c onsistent with res u lts obtained by oth- 
ers on smal ler samp les 'Toft e t all (120071): | van Dokkum et all 
(2008); FranjLetaLl C2OO8); To ft et all (120091) 7 

Table [T] lists the best-fit slopes and normalizations of the 
quiescent galaxy size-mass relations seen in Figured For this 
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we defined "quiescent" as galaxies exhibiting SSFR< 0.3 /tn, 
where tn is the age of the universe at each galaxy's redshift, 
effectively picking out the red and yellow points in Figure [3] 
(see also ^and the Appendix). In each redshift interval the 
median log effective radius was calculated in mass bins of 
width 0.2 dex, only including galaxies above the mass com- 
pleteness limit at each redshift, and a least-squares fit was 
performed to these median points. For convenience, the nor- 
malizations of the power-law fits in Table [T] are defined as the 
log effective radius of a typical galaxy with M = 1O"M0. Un- 
certainties were computed using bootstrap resampling. Note 
that any comparisons between the mass-size relations seen in 
SDSS and UDS should be treated with caution, as systematic 
differences in the size and mass determinations between these 
two samples may exist. Nonetheless, strong evolution in the 
normalization of the size-mass relation is evident; the slope of 
the relation, however, does not significantly change between 
z = 0.5-2. 

As the strong mass-size correlation of quiescent galaxies 
implies, sSFR does not depend strictly on size or mass alone, 
but rather on some combination of the two. The bottom pan- 
els of Figure |3] instead show how sSFRs depend on stellar 
mass surface density (E* = M /iTir^) and mass. Galaxies 
with the highest surface densities have weak star formation, 
with very little dependence on mass. In addition, we can de- 
fine a "threshold surface density" between qu iescent and star- 
forming galaxies (similar to that defined by Franx et al."2008') 
at each redshift; in Figure|3] the sharp color division (between 
green and yellow) at log sSFR= -10.0 could be taken as a sim- 
ple example of one such threshold. This clearly increases 
with increasing redshift. Note that this particular threshold 
depends only weakly on mass (i.e. is nearly horizontal in 
the E-mass plot), confirming that surface density is more fun- 
damentally jelated_to_star formation activity in galaxies than 
mass ( Franx et al.ll2008h . 

Furthermore, this figure clearly shows at which epoch 
galaxies of a given size, mass, and density were forming stars: 
for example, galaxies with M = 10" M0 were generally star- 
forming at z ^ 2 but by z = 1 .0- 1 .5 most of their star forma- 
tion activity had ceased. However, the most compact massive 
galaxies (the lower right-hand region of the radius-mass plot) 
are quiescent at all redshifts considered here, and must have 
been quenched at z > 2. 

Most intriguingly, the compact quiescent galaxies visible 
at high redshift do not maintain their small sizes or high 
surface densities to lower redshifts; instead, the quiescent 
galaxies progressively evolve to larger radii and lower sur- 
face densities with decreasing redshift. This phenomenon is 
apparent from the UDS data alone in Figure [3] but is most 
starkly illustrated by the z ^ SDSS panel where essentially 
no massive, compact galaxies are visib le. This re-illustrate s 
the phenomenon po inte d out by .van Dokkumetal.1 (Hooi, 
iTruiillo et al.1 (l2009h . and lTavlor et al.1 (l2009bl) . butwith mul- 
tiple redshift bins the depletion of these galaxies is now easily 
seen. Since these galaxies are unlikely to be losing more than 
a small fraction of their mass (via mass loss from evolved 
stars; Damjanov et al. 2009), size evolution must be the pri- 
mary driver behind their disappearance. 

The trends shown in Figure |3]could in principle be affected 
by certain assumptions we have made. First, a minimum sSFR 
of 10~" yr"' has been imposed as an lower limit to what can 
be measured with SED fitting; however, varying this limit 
only serves to shift the apparent "quiescence threshold" in this 



TABLE 1 

Fits to the quiescent galaxy size-mass 

CORRELATIONS SHOWN IN FIGURe[3] 



Redshift 


A 


b 


SDSS 


0.634 ±0.004 


0.41 ±0.01 


0.5 <z< 1.0 


0.46 ± 0.02 


0.54 ±0.06 


1.0 <z< 1.5 


0.35 ±0.01 


0.56 ±0.06 


1.5 <z<2.0 


0.25 ±0.01 


0.50 ±0.07 



Note. — These fits only include quiescent galaxies (sSFR< 
0.3f/f , where tn is the age of the universe at each redshift (see 
Appendix); red points in Figure |3). Note that there may be 
systematic offsets between the SDSS and high-redshift sizes 
and/or masses. Best-fit parameters are defined as logr^ = 
A + ^;(logM/M0 - 11), so that 10* is the typical radius of a 
10" Mq galaxy in kpc. 

figure, without affecting the overall trends. Again, the abso- 
lute radius, mass, and sSFR values may exhibit some system- 
atic offsets between the SDSS and UDS samples due to dif- 
ferent fitting and calculation methods, but the relative values 
within each frame (and between the three UDS redshift bins) 
are generally consistent. It is also possible that some degen- 
eracies may exist between the mass and sSFR estimates used 
here since both stem from the same template fits (and hence 
aren't entirely independent quantities). To test the robustness 
of these results, we reproduced Figure |3] using the SWIRE 
24/im data in this field as a fully independent sSFR diagnos- 
tic. A relation between the sSFR and fiA^un/ fK (with a minor 
correction using f^lfK to account for low-mass blue galax- 
ies) is presented in Williams et al. (2009, Appendix). Using 
the 24/im data in place of the best-fit SED sSFR does not 
significantly change the results shown in Figure |3] (though it 
is somewhat noisier, due to the shallowness of the SWIRE 
24/im data). 

4. THE structural EVOLUTION OF STAR-FORMING AND 
QUIESCENT GALAXIES 

The strong size and surface density evolution of the "red 
and dead" galaxy population, and the apparent "disappear- 
ance" of the most compact objects, presents an intriguing puz- 
zle for observers and theorists alike. In order to take full ad- 
vantage of the unprecedented sample size of the UDS and pro- 
vide constraints for present and future models, we now turn to 
quantifying these trends. 

Figure |4] shows the sizes and surface densities of > 
]^q10.8 galaxies as a function of redshift. This mass limit 
is equivale nt (given the difference in IMF) to the limits em- 
ployed by Ivan Dokkum et al.l (I2008h and Ivan der Wei et alj 
( 120091) ■ Quiescent galaxies (selected via the "quiescent 
red sequence" metho d described in the Appendix and by 
iWilliams et all l2009l) are shown in red, and star-forming 
galaxies are shown in blue. Ivan der Wei et al. (2009) note that 
any proposed models for the size evolution of quiescent galax- 
ies (in particular those relying on dry mergers) are addition- 
ally constrained by the co-moving mass and/or number densi- 
ties of these galaxies as a function of redshift. The right-hand 
panel of this figure thus shows the number density evolution 
of quiescent galaxies with log(M,t/M0) > 10.8; this was cal- 
culated at z > 0.5 with galaxy counts in three redshift bins 
from the UDS, and at z by integrating the S DSS Value 
Added Galaxy Catalog mass function tabulated by I Yang et alj 
( 120091) . The plotted UDS number densities only include Pois- 
son uncertainties; cosmic variance and selection effects have 
not been taken into account. Since the UDS sample is more 
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Fig. 4. — Left: Evolution of log(M*/MQ) > 10.8 quiescent (red points) and star-forming (hhie points) galaxy effective radii. Here "quiescent galaxies" are 
defined as those falling on the quiescent red sequence illustrated in Figure|6] The large blue and red crosses show the median radii of star-forming and quiescent 
galaxies respectively in bins of A; = 0.25. The dashed fine is a power-law (in 1 +z) fit to the small red points. Small black points near z = 0.06 are from the 
SDSS; note that these points were not included in the power-law fit. Center: Same as the left panel, but for galaxy surface densities. Right: Evolution of the 
massive quiescent galaxy number density with redshift. Error bars for the z = 0.5-2 points do not take cosmic variance or selection uncertainties into account; 
only Poisson errors are included. Here the z ~ point is computed from the SDSS red galaxy mass function of Yang et al. (2009). The observed strong evolution 
since z = 2 in each of the three quantities (size, surface density, and comoving number density) is well-described with a power law in (1 +z)", where a = -1.09, 
2.37, and -1.0 respectively. 



than 99% complete at these somewhat higher masses, no cor- 
rection to the best-fit number densities is necessary. 

The trends seen in these three quantities are quite well- 
fitted with power law functions of the form Y ^ {1 +z)°', 
where a = -1.09, 2.38, and -1.0 for Y = r^, S*, and n re- 
spectively (best-fit a and normalization values are also listed 
in Table for this and other samples). These fits were per- 
formed only to individual galaxies between z = 0.5 and 2.0, 
excluding the SDSS data and points that are major outliers 
in effective radius (logr^ > 1.1 or logr^ < -0.25). However, 
the SDSS data point was included in the fit to the number 



density evolution. For comparison, Ivan der Wei et a 



1ll2009h 



find th at effective radii of early-type galaxies with ISalpeted 
(119551) masses > lO" Mq (equivalent to our mass limit) are 
0.54±0. 04 times smaller at z= 1 than z = and 0.3 ±0.1 times 
smaller at z = 2. In both size and surface density the evolution 
they infer is somewhat slower than what we find; this might 
be a result of different sample selection methods. They also 
derive a comoving mass density equal to 35 ± 13% of the lo- 
cal value at z = 1 and 10^g% at z = 2.4, while our number 
density evolution is somewhat shallower than this (although 
roughly consistent between z = and 1), implying significant 
mass growth in the existing quiescent galaxy population over 
these redshifts. 

The above fits include galaxies spanning a somewhat wide 
range of masses (logM/M© > 10.8), but with this large sam- 
ple it is possible to investigate the growth of galaxies as a 
function of stellar mass. Table [3] lists size evolution power- 
law indices (a^) and normalizations (/?,-) for these same qui- 
escent and star-forming subsamples (as well as the total sam- 
ple) in three narrower mass bins. A strong trend is immedi- 
ately apparent: more massive galaxies undergo significantly 
faster size evolution with redshift. For the total sample 
steepens from -0.52 to -1 .09 between stellar masses of lO'"'^ 
and > 10^' Mq, and from about -0.8 to -1.3 over the same 
mass interval for both the quiescent and star-forming subsam- 
ples. A similar trend was also noted by Franxet al. (2008). 
Although this could in principle be due to systematic galaxy 
mass overestimates at high redshift, such offsets would need 
to be quite large and redshift-dependent (e.g., ^ 0.2-0.3 dex 
too high at z ^ 2, but correct at z ^ 0.5). We therefore con- 
clude that there is strong evidence for differential size evolu- 



tion with mass. 

As noted in the Appendix, a number of other physically- 
meaningful definitions of "quiescence" are possible and may 
be simpler to define within certain models. We thus repeated 
the power law fits to other logM/M0 > 10.8 quiescent galaxy 
samples defined through the other two selection methods de- 
scribed in the Appendix, i.e. based on low best-fit sSFR 
(< 0.3f//) and red rest-frame colors (CZ-Vrest > 1-5). For 
completeness we performed the same fits to the size and sur- 
face density evolution of the complementary samples (star- 
forming and blue), as well as all massive galaxies in this sam- 
ple. Errors on the power-law slopes were computed using 
bootstrap resampling. The results of these fits are tabulated, 
along with the reference "quiescent red sequence" sample, in 
Table |2] The sizes and surface densities of the different sam- 
ples evolve at similar rates (differing by no more than 2a for 
any two "equivalent" samples, e.g. low-sSFR and quiescent 
red sequence). The observed structural evolution thus appears 
to be more or less independent of the exact technique used 
to select quiescent galaxies. Interestingly, massive galaxies 
overall exhibit slower evolution than either of the star-forming 
or "dead" subsamples; this is simply a consequence of the in- 
creasing fraction of massive quiescent galaxies at lower red- 
shifts. 

5. HOW UNIVERSAL IS THE STAR FORMATION-GALAXY 
STRUCTURE CONNECTION? 

From the preceding sections it is clear that star formation 
and galaxy structure are tightly coupled. While it was al- 
ready estabhshed that such correlations exist tiToft et al.ll2007l ; 
iFranx et ani2008i) . the order-of-magnitude larger galaxy sam- 
ple provided by the UDS allows us to not only observe and 
quantitatively measure said correlations, but also to investi- 
gate how uniformly the galaxy population at high redshift is 
described by them. In this final section we present some alter- 
nate tests of the connection between structure and star forma- 
tion, with the specific goal of qualitatively understanding the 
degree to which these relations are "universal." 

5.1. Average Spectral Energy Distributions 

Average SEDs are an invaluable tool for ascertaining both 
the underlying spectral shapes of large galaxy samples and the 
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TABLE 2 

Best-fit power law parameters' for the structural evolution of massive galaxies 



Sample 


Or 












ALL 


-0.88 ±0.06 


0.73 ±0.02 


1.92±0.11 


7.90 ±0.04 






Quiescent red sequence 


-1.09 ±0.08 


0.74 ±0.03 


2.37±0.14 


7.87 ±0.05 


-0.96 ±0.1 3 


-3.44 ±0.07 


Non-(quiescent RS) 


-1.09 ±0.08 


0.88 ±0.03 


2.44 ±0.15 


7.52 ±0.06 






sSFR< 0.3/f« 


-1.17 ±0.07 


0.78 ±0.02 


2.52±0.13 


7.80 ±0.04 


-0.73 ±0.14 


-3.45 ±0.08 


sSFR>0.3/fH 


-0.92 ± 0.09 


0.85 ±0.03 


2.14±0.18 


7.57 ±0.07 






(U-V) > 1.5 


-1.00 ±0.06 


0.75 ±0.02 


2.23±0.12 


7.83 ±0.04 


-0.47 ±0.19 


-3.45 ±0.10 


iU-V) < 1.5 


-1.26±0.19 


0.98 ±0.08 


2.79 ±0.40 


7.30±0.16 







Note. — In all subsamples a mass cut of log(A/*/MQ) > 10.8 has been imposed. The SDSS data were only included in 
the tit for a„ ; the a,- and aj: fits were based solely on UDS data. 



' Power-law parameters are defined such that f(z) = bf(l where / is r^, E*, or n; uncertainties are estimated with 

bootstrap resampling. 



TABLE 3 

Size evolution power-law fits in different mass bins 



All Quiescent Star-forming 

logMt/M0 ar hr ctr b,- a,- b,- 

10.6-10.8 -0.51 ±0.07 0.51 ±0.02 -0.75±0.10 0.44±0.03 -0.77±0.08 0.70±0.03 

10.8-11.0 -0.81 ±0.07 0.66±0.03 -1.06±0.11 0.66±0.04 -1.10±0.10 0.86±0.04 

>11.0 -1.09± 0.09 0.87 ±0.03 -1.30±0.10 0.90±0.03 -1.32±0.15 1.03±0.05 

Note. — Power-law parameters a,- and b,- are defined such that ^(z) = bril+z)"'' ■ Quiescent and star-forming subsamples are selected 
via the "quiescent red sequence" method described in the Appendix. 



scatter (or lack thereof) in their SEDs. To determine whether 
dense galaxies are universally quiescent, we separate all z = 
1-2, K < 22 A UDS galaxies into four surface density bins 
and construct average rest-frame SEDs using their observed, 
de-redshifted photometric data points. Figure |5] shows these 
photometric data (normalized at 8000A) in each bin; median 
and 75% dispersion values of the individual galaxy points in 
bins of rest-fra me waveleng t h are overplotted. To guide the 
eye, four Bruzu aT& Charlog (l2003h stellar population models 
(from bottom to top, one single-burst model with age 1 Gyr, 
and three constant star-forming with increasing levels of dust 
obscuration) are also shown in each panel; note that these are 
not fits to the data. 

This figure provides a nearly model-independent confirma- 
tion of the result shown in Figure [3] namely, that the dens- 
est galaxies exhibit an average spectral shape that is well- 
represented by an evolved stellar population, with remarkably 
low dispersion. Interestingly, there appears to be a smooth 
progression between average surface density and dust proper- 
ties: galaxies with the lowest log E are typically blue and have 
low dust obscuration, but with increasing surface density the 
SEDs become dustier. It is especially notable that this does 
not appear to be caused by the addition of progressively more 
quiescent galaxies to the blue galaxy population; at least in 
the lowest two logS bins, the median SEDs are unambigu- 
ously within the "dusty" regime. Thus, it appears that there 
is not only an anticorrelation between sSFR and surface den- 
sity, but also a positive correlation between dust and surface 
density for star-forming galaxies. However, since the sSFR is 
a mass-normalized quantity and the (qualitative) dust obscu- 
ration seen in Figure |5] is not, we caution that this last point 
may be entirely due to more massive galaxies containing more 
total dust. 

More importantly, the SED dispersion of the highest- 
density galaxies is at least a factor of 2 lower than for less- 
dense galaxies. This strongly implies that the highest density 
galaxies are overwhelmingly represented by a single model 



with relatively few outliers; at lower surface densities, on the 
other hand, galaxies show larger dispersions in their spectra 
and thus span a range of spectral shapes. The comparatively 
low scatter in the high density galaxies further reinforces the 
idea that they are by and large quiescent; if they instead were 
predominantly red due to dust obscuration, they would most 
likely exhibit large dispersions comparable to the SEDs in the 
lower-density bins. A quiescent galaxy model also provides 
a somewhat closer representation of the 4000A break region 
than the Ay = 3 model, but the models underpredict the ob- 
served UV flux in these galaxies, indicating that there may be 
some contribution from very dusty starbursts or low levels of 
residual star formation in the "quiescent" galaxies. 

5.2. Galaxy structure in color-color space 

Up until now we have primarily investigated the star- 
formation (and SED) properties of galaxies as a function of 
structural parameters. We now consider the converse ques- 
tion: what are the structural parameters of galaxies as a func- 
tion of their star-format ion properties? 

Wil liams et alJ(l2009h describe the use of the rest-frame U — 
V vs. V-J (hereafter UVJ) diagram as a powerful diagnostic 
of both star-formation activity and dust obscuration. Figure |6] 
shows this diagram for all z = 1 - 2 galaxies in the UDS. In 
short, quiescent galaxies fall in a discrete clump in the upper 
left-hand region of this plot (above the dashed line), while 
star-forming galaxies form a "dust sequence" extending from 
the lower-left (blue) to upper-right (red) region. The utility of 
this plot as a star-formation diagnostic was confirmed using 
stacked 24/im data; a more thorough discussion is provided 
in the Appendix. 

In each panel of Figure |6] galaxies have been binned and 
color-coded by their median (a) sSFR, (b) mass, (c) effec- 
tive radius, and (d) surface density. In the first panel we con- 
firm with our SED-based sSFR estimates what was previously 
shown bvl Williams et alj ( 120091) with 24/im data - the "quies- 
cent red sequence" (above the dashed diagonal line) is indeed 
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Fig. 5. — Average spectral energy distributions of all K < 22.4 galaxies between z = 1 - 2 in four stellar surface density bins. Small dots represent broadband 
flux mea surements of individual galaxies, thick black bars denote the median and interquartile dispersion of the individual points, and overplotted lines show 
IBruzual & Charloli ^2003.) models of quiescent ( 1 Gyr age; solid Hues) and star-forming galaxies with Ay = i (dotted), 2 (short dashed), and 3 (long dashed). The 
composite SED of the highest surface-density galaxies is well-represented by an old or a very dusty stellar population template. 



overwhelmingly populated by quiescent galaxies, while red 
galaxies that do not lie on this sequence have significant star 
formation, and are thus dusty starbursts. Panels (b) and (c) 
illustrate how most quiescent galaxies have large masses and 
small effective radii, but some systematic variation is clearly 
present (due to the size-mass relation). However, in the final 
panel it is evident that quiescent galaxies almost uniformly ex- 
hibit high surface densities; there appears to be little variation 
in surface density with either mass or size along the quiescent 
red sequence. The converse also appears to hold true: the 
highest-density galaxies almost entirely fall within the "qui- 
escent" region, confirming the result shown in Figure |5] (al- 
though some of the reddest star-forming galaxies exhibit den- 
sities approaching those of quiescent galaxies). 

Though useful for illustrating the utility of the UVJ dia- 
gram. Figure |6] spans a fairly wide redshift range and may 
also contain systematic effects due to incompleteness at low 
masses. We therefore re-plot panels (c-d), the size and sur- 



face density as a function of color, in Figure [7] for four red- 
shift bins. A mass limit of logM* > 10.6 (the 75% red-galaxy 
mass completeness limit at z = 1.5-2) is also imposed; thus, 
this figure is complete only up to z = 2 and we caution that 
the z = 2-2.5 bin likely suffers from some incompleteness 
effects as well as larger uncertainties in the masses and rest- 
frame colors. 



The color-color plots shown in Figure|2]present a wealth of 
information about the structural evolution of massive galax- 
ies, and indeed provide a novel view of the results presented 
thus far. First, at all redshifts, quiescent galaxies (above the 
dashed line in each plot) exhibit small radii and large surface 
densities, though their surface densities are far more uniform 
than their sizes. However, while most massive compact galax- 
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Fig. 6. — Rest-frame color-color plot for galaxies with 1.0 < z < 2.0, 
binned and color-coded by (a) log specific star-formation rate derived through 
SED fitting, (b) log(mass), (c) log(effective radius), and (d) log sur face den- 
sity. The dashed line denotes the quiescent galaxy criterion of Williams et al.l 
(2009). where galaxies above the diagonal belong to the "quiescent red se- 
quence" and below the diagonal are star-forming. 

ies are quiescent, the same is not always true for massive, 
high-surface density galaxies: at z ^ 2 there are many dense, 
star-forming galaxies, but the number of such objects drops 
dramatically with decreasing redshift, suggesting that galax- 
ies become quiescent at progressively lower surface densities 
with time. Even with this effect taken into account, it remains 
clear that the quiescent galaxy population undergoes dramatic 
structural changes from z = 2 to 0, so another mechanism must 
be at work to bring about this transformation. 

6. DISCUSSION 
6.1. Comparison to previous work 

By combining size measurements of a very large K- 
selected galaxy sample with SED fitting to determine SFRs 
and masses, we find that quiescent galaxies at all redshifts 
are far more compact than star-forming galaxies of compa- 
rable mass. Similar resu lts have been found by other au- 
thors at high redshift (e.g.jDaddi et al."2005 UToft et al II2007I: 
iFranx et alj|2008t iBuitrago et al.1 12008; Ci matti et all l2008l) . 
but with the larger sample size considered here, detailed in- 
vestigations into the correlations between galaxy structure and 
star formation (and their evolution with redshift) are now pos- 
sible. As shown in Figure[3] a well-defined mass-size relation 
for quiescent galaxies exists up to at least z = 2, but evolves to 
larger sizes with decreasing redshift. This figure also shows a 
clear anticorrelation between star formation activity and stel- 
lar mass surface density (nearly independent of mass), and 
an evolving surface density "threshold" above which galax- 
ies are predomin antly quiescent, confirming the findings of 
IFranx etalJ (1200 8). 

The effective radii of logM/M0 > 10.8 quiescent galaxies 
in this sample evolve as ^ (l+z)"' ' between 0.5 < z < 2, 
with the power-law index varying only slightly depending on 
the exact definition of quiescence (see Table |2|i; this is in 
excellent agreement with the (1 +z)~''^^^ '^ "upsizing" ob- 
served by IFranx et al] ( 120081) in a smaller sample from z = 
0-3, a nd the factor 4 .3 ib 0.7 size evolution since z = 2.3 
seen by iBuitrapo et alj (12008.) . However, the slope of the 



size evolution varies strongly with mass, such that the most 
massiv e galaxies show the fastest mass evolution (see Ta- 
bleO. iTruiillo et al.l (l2007t) found evidence for similar mass- 
dependent evolution of "disk-like" galaxies between < z < 
1 ; the sample presented here confirms that this result extends 
to higher redshifts and applies to the entire quiescent galaxy 
population. 

The evolution in the UDS sample also agrees well with 
other previous studie s specifically focu sing on early-type 
galaxies; for example. Ivan der Wei et al.l (l200 9) find a factor 
of ~ 2 growth in size from z = 1 to the present and a factor 
of ~ 3 since z = 2, which agrees with the observed evolution 
in our "quiescent red sequence" sample. On the other hand, 
we find all galaxies above logM = 10.8 exhibit size evolution 
wit h a power-law inde x of -0.88 ± 0.06, slightly faster than 
the IFranx etaP (120081) index of -0.71 ±0.07; however, this 
may simply be due to the different redshift ranges considered. 
The total massive galaxy sample grows more slowly in size 
than either quiescent or star-forming subsamples of similar 
mass, highlighting the increasing dominance of "dead" ellip- 
ticals over starbursts at lower redshifts. 

In the highest redshift bin, z = 1.5-2, we see a popula- 
tion of ultra-compact, high-density quiescent galaxies that are 
no longer present in the local universe, or even (for the most 
part) at z = 0.5- 1. Studies with higher-resolution data find 
that the typical sizes of M > lO" Mp^ quiesce nt galaxies at 
z ^ 2.3 are re ^ 1 kpc (Ivan Dokkum et al]|200 8'), and a t z > 2 
no qu iescent galaxies with r^, > 2 kpc are seen ( Tof t et al] 
I2007h . Although 1 kpc is smaller than what we can confi- 
dently measure for individual galaxies, this size is consis- 
tent with our measurements. Figure [3] does, however suggest 
the presence of quiescent galaxies at 1.5 < z < 2 with some- 
what extended radii (r^. = 2-3 kpc). A few such objects are 
als o seen at similar redshif ts (and with space-based imaging) 
by iDamjanov et alJ ((200^; thus, the difference may simply 
be due to mild evolution of these galaxies from z ^ 2.3 to 
z = 1.5. Follow-up observations of candidate extended quies- 
cent galaxies at z ~ 2 would confirm or rule out their exis- 
tence, and perhaps shed some light on the mechanism behind 
the size evolution. 

6.2. What causes the structural evolution? 

The presence of ultra-compact massive galaxies with 
extreme surface densit ies (and velocity dispersions; 
Ivan Dokkum. Kriek. & Fr anx 2009) at z > 2 is puzzling, since 
the prevalence of such objects is already diminished substan- 
tially b y z 1.5 (Franx et al. 2008; Cenarro & Trujillo 2009!; 
ICappel lari et al. 2009) and they effectively no longer exist 
at z = (iTruiillo et al.„2009l; iTavlor et al.ll2009bl) . Major and 
minor mergers, in particular those involving only gas-poor 
galaxies ("dry mergers"), likely play a significant role in both 
the mass and size evolution of thes e objects ( Khochfar & Sil3 
2008; Hopkins et al."2009b; Feldmann et al. 2009). However, 
as suggested by van der Wei et al . (2009), other processes 
are also likely to be important, in particular the quenching of 
progressively larger star-forming galaxies at lower redshift. 
Indeed, this may account for the seemingly incongruous 
result that the quiescent and star-forming samples each 
evolve faster than the overall galaxy population. In other 
words, if the compact, high-surface-density "tail" of the 
star-forming galaxy population is preferentially quenched at 
any given redshift, this will accelerate the apparent evolution 
of both subsamples. 

The strong size growth of both massive star-forming and 
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Fig. 7. — Le/f panel: Rest-frame colors of galaxies in four redshift bins, color coded by median log effective radius. Quiescent galaxies lie above and to the 
left of the dashed hne. Right panel: Same as left, but with points color-coded by log surface density (in logM© /kpc^). A mass limit of logM* > 10.6 has been 
imposed; thus the z = 2-2.5 bin is likely incomplete. Quiescent galaxies have distinctly smaller sizes than star-forming galaxies at all redshifts; however, at z > 1 
there appears to be a population of dense, star-forming galaxies. 



quiescent galaxies by itself suggests that major gas-poor 
("dry") mergers are not the only process driving massive 
galaxies' strong structural evolution. Galaxies which are un- 
dergoing significant star-formation must contain substantial 
amounts of gas, and therefore by definition cannot undergo 
dry mergers. However, the sizes and densities of massive star- 
forming galaxies are seen to evolve at a rate similar to the qui- 
escent galaxies. It is therefore possible that at least one of the 
mechanisms acting on compact quiescent galaxies similarly 
affects star-forming galaxies. 

Another proposed mechanism is adiabatic expansion, 
whereby mass loss from evolved stars decreases the potential 
well depth and therefore increases effective radii as galaxies 
age. As discussed by Damjanov et al. (2009), however, such 
a mechanism does not seem feasible because only a small 
fraction of the total mass is expected to be lost in passively- 
evolving stellar populations between z ^ 1.5 and z = 0. The 
size growth may also be explained by many mi nor mergers 
or the accretion of relatively low-mass satellites. iNaab et aLl 
(|2009) point out that minor mergers are more efficient at in- 
creasing the radii of the primary galaxy than major mergers, 
per unit mass of the secondary galaxies. This scenario is also 
particularly attractive because it would depend only weakly (if 
at all) on whether or not the central galaxy is forming stars, 
and therefore could explain the size growth of both massive 
and quiescent galaxies. In a forthcoming paper we will inves- 
tigate the role of major and minor mergers on massive galaxy 
evolution (R. J. Williams et al., in preparation). 

Although it is yet unclear what causes the observed size 
and surface density evolution of massive galaxies, the results 
presented herein illustrate how large, ground-based, near-lR 
surveys can provide large statistical samples, with sufficiently 
accurate size measurements, for comparison with the variety 
of theoretical models now under development. Upcoming in- 
struments like VISTA and WFC3 will greatly enhance the vol- 
ume and quality of available data; however, the proliferation 
of complementary observational results from existing surveys 
already present an important challenge to, and constraints on, 
these models. 



7. SUMMARY 

By applying accurate size measurements and broadband 
SED fitting to the largest photometric sample of massive 
galaxies from z = 0.5-2 to date, we have investigated in de- 
tail the interplay between star formation, galaxy structure, and 
mass, and its evolution with redshift. Our main conclusions 
are as follows: 

1. Galaxies with low sSFRs follow a well-defined mass- 
size relation up to at least z = 2, and this relation moves 
to larger sizes at lower redshifts. 

2. The anticorrelation between stellar mass surface den- 
sity and star formation activity is much stronger than 
the size-sSFR relation; this confirms the result of 
iFranx et al. (2008) that surface density and star forma- 
tion are tightly connected at all redshifts. 

3. Even with the far larger galaxy sample studied here, the 
densest, most massive galaxies seen at z ^ 2 have es- 
sentially disappeared by z = 0.5 - 1 . 

4. The sizes and surface densities of massive quiescent 
and star-forming galaxies evolve smoothly with time 
(following simple power-law behavior in (1 +z)), with 
more massive galaxies exhibiting faster evolution. 

5. Although galaxies with low surface densities exhibit 
a wide range of dust and star-formation properties, 
at the highest surface densities their SEDs are well- 
represented by a single quiescent galaxy model; thus, 
these "dense" galaxies comprise a relatively homoge- 
neous population. 
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APPENDIX 
DEFINING QUIESCENCE 

The primary goal of this work is to investigate the structural properties and evolution of quiescent galaxies, defined as those 
falling below some sSFR threshold. In practice there are a number of ways to define such a sample, particularly when only 
broadband photometric data are available. In this Appendix we describe the primary selection technique employed for this 
analysis (the UVJ color-color selection described bv I Williams et ah 2009), as well as two alternate methods considered in ^ 
Average SEDs of the quiescent galaxies selected via these three techniques are shown in Figure [8] Note that these three average 
SEDs appear quite similar, suggesting that the three techniques are effective at selecting quiescent galaxies (though we caution 
that very dusty galaxies would not clearly stand out in these plots; see the models in Figure|5]). 



Quiescent Red Sequence 

For the most part, galaxies which have ceased their star formation activity appear red due to their evolved stellar populations, 
and up to z ~ 1.5 or higher (depending on the data quality) form a well-defined "red sequence" when plotted in a color-magnitude 
diagram. However, at higher redshifts progressively larger numbers of dusty starbursts are present, and these often exhibit optical 
colors which mimic "red and de ad" galaxies, making the red sequence methods less reliable (especially with only broadband 
photometric data). As shown bv IWuvts et al.l (l2007h . incorporating rest-frame near-lR data into the analysis makes it possible 
to distinguish between dead and dusty galaxies. In particular, quiescent and star-forming galaxies occupy di s tinct r egions of the 
rest-frame U -V v&.V -J (UVJ) color space; with the UKIDSS UDS and overlapping data, Wil Uams et all (l2009h found that a 
"quiescent red sequence" is clearly visible (and distinct from dusty star-forming galaxies) in the UVJ distribution up to z = 2, and 
criteria based on these colors provide an effective physical basis for separating quiescent from star-forming galaxies. 

Figure |6] (left) shows the U VJ diagram for all galaxies between 1 .0 < z < 2.0 in the UDS with K < 22.4. The quiescent galaxy 
selection criterion defined in I Williams et al.l (i2002) is shown as a dashed line, where quiescent galaxies lie above and to the left 
of the line and star-forming galaxies are below. For reference, the criterion at z = 1 - 2 is: 

(U-V)>0Mx(V-J) + 0A9 (Al) 

At lower redshifts the criterion changes slightly, shifting upwards by 0. 1 dex (i.e. the 0.49 becomes 0.59). Additional constraints 
of U-V > 1.3 and V-J < 1.6 are imposed to prevent too many star-forming galaxies from scattering into the selection region. 

We note that some of the galaxies defined via this method as "passive" may nonetheless still have emission in the mid-lR from 
dusty star formation or active galactic nuclei (AGNs). When we compare this technique to deep 24/im imaging in the Chandra 
Deep Field-South, we find that 4 out of 29 "quiescent red sequence" galaxies at z = 2 have significant 24/im e mission. Hence 
this classification is not a guarantee that a given galaxy has no star formation. On average, however, as shown by IWilliams et al.l 
(|2009) for this field, the 24/^m emission is very low for these galaxies. 

This empirical "quiescent red sequence" method is preferred to, e.g., SED fitting because photometric redshifts (as well as inter- 
polated rest-frame colors based on these redshifts and observed fluxes) are typically the best-constrained parameter in broadband 
SED fitting. By providing a directly empirical criterion, the rest-frame color separation is also less subject to template-dependent 
systematics that may affect SED-based sSFRs. We therefore adopt this color-color cut as our primary method for selecting 
quiescent galaxy samples. 

Evolving sSFR cut 

Since the UVJ technique is relatively new and some theoretical models may better predict SFRs, we consider a sample (listed 
as sSFR< 0.3 /tff in Table |2]l based on an evolving cut in galaxy sSFRs. In this case, quiescent galaxies are defined as those 
exhibiting sSFRs less than Q3ltH{Zi), w here tnizj) is the a ge o f the universe for ea ch galaxy at redshift z,-, analogous to the 
quiescent galaxy definitions employed bv lFranx et al.l (l2008h and lFontana et al.l (120091) . The factor 0.3 is somewhat arbitrary, but 
is comparable to the sSFRs of red sequence galaxies in our SDSS subsample. 

Red galaxies 

Finally, one of the simplest criteria that can be defined is a single rest-frame color cut, i.e. assuming that all galaxies redder 
than a certain value are quiescent. As noted before this method is somewhat flawed because sufficiently dusty starburst galaxies 
can mimic the colors of truly "red and dead" galaxies. Nonetheless, for completeness we construct a third sample with rest-frame 
U-V > 1.5 irrespective of sSFR. 
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